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Abstract 

“Guess Who?” is a popular two player game where players ask “Yes”/“No” questions to search for 
their opponent’s secret identity from a pool of possible candidates. This is modeled as a simple stochastic 
game. Using this model, the optimal strategy is explicitly found. Contrary to popular belief, performing 
a binary search is not always optimal. Instead, the optimal strategy for the player who trails is to make 
certain bold plays in an attempt catch up. This is discovered by first analyzing a continuous version of 
the game where players play indehnitely and the winner is never decided after finitely many rounds. 


1 Introduction 


“Guess Who?” is a zero-sum two player game where players take turns asking “Yes” or “No” questions to find 
their opponent’s secret identity [T2]. Each player keeps track of a finite pool of possible candidates for their 
opponent’s secret identity. Players alternate turns asking a “Yes” or “No” question (that their opponent must 
answer truthfully) about their opponent’s identity and reduce their pool of possible candidates. For example, 
if Player 1 asks: “Does your character have blue eyes?” and Player 2 answers “No”, Player 1 eliminates all 
candidates in his pool that have blue eyes. Eventually, only one candidate remains in the pool and this last 
character must be their opponent’s secret identity! The first player to narrow their pool down to a single 
character in this way wins the game. 

To model this game mathematically we will make the following assumptions: 

• The secret identity of the opponent is uniformly distributed amongst all possible candidates. Because 
of the eliminating nature of the “Yes”/“No” questions, this property persists throughout the game. With 
this assumption, only the number of remaining characters in the pool is relevant to the analysis, not 
the details of which characters in particular are remaining. 

• For any candidate pool size n and any l<6<n—1, itis always possible to construct a question for 
which exactly b candidates correspond to the “Yes” answer. One way to do this is to sort the candidates 

alphabetically and ask “Does your character’s name come alphabetically before_?” where the name 

is chosen to be the name which is 6-th on the list. We use the terminology a “bid of size 6” for such 
a “Yes”/“No” question. 

With these assumptions, “Guess Who?” can be modeled as a so called simple stochastic game as defined 
by Gondon [2]. The strategy of the game is in choosing the bid size. Players can balance risk vs. reward on 
each turn by varying their bid size. The bid 6 = 1 is a risky play: it gives a small chance to win the game 
immediately but is more likely to reduce the candidate pool by only 1. In contrast, a bid of 6 = is the 
least risky bid. The main result of this article is to find the optimal bidding strategy for “Guess Who” and 
prove that it is optimal: 

Theorem 1.1. (Optimal Strategy and Optimal Probabilities for “Guess Who?”) When it is Player I’s turn, 
if Player 1 has n candidates in their pool and Player 2 has m candidates in their pool, then Player 1 has the 
following optimal strategy: 

• If n> 2'=+! -k 1 while 2^ + 1 < m < 2'”+^ for some k S N U {0}, then Player 1 is in the weeds and 

must make a bold move to catch up! Their optimal play is a bid of b*{n,m) = 2^ = and 

the probability Player 1 wins if both players play optimally is: 


p*(n, m) 


2k+i 2 2^'=+! -k 1 

n 3 nm 


( 1 . 1 ) 
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• If2^ + l<n< 2*+^ while m > 2^ + 1 for some fc € N U {0}, then Player 1 has the upper hand and can 
stay ahead by making a low risk play! Their optimal play is a bid ofb*{n, m) = and the probability 

Player 1 wins if both players play optimally is: 


X , 2^= 2 22fc + 

p (n,m) = 1 -h -- 

m o nm 


( 1 . 2 ) 


The proof of Theorem |1.1| is provided in Section]^ and goes by solving a recurrence relation that p*{n, m) 
satisfies. Figure [^shows a plot of Player I’s probability of winning as a function of the pool sizes {n,m). 



Figure 1: The probability p*{n,m) of Player 1 winning “Guess Who?” on their turn when Player 1 has n 
candidates in their pool and Player 2 has m candidates in their pool 


1.1 Bold Plays 

It is an elementary entropy calculation that the safe bid b — \_\n\ will minimize the expected number of 
“Yes”/”No” questions for Player 1. This is not the optimal strategy in “Guess Who?” because Player 1 does 
not want to minimize this expected value: instead he wants to maximize the probability of getting there 
before Player 2 does. This race against the opponent is what drives the optimal bidding behavior when 
Player 1 is significantly behind his opponent. When this happens, Player I’s optimal strategy is a bold bid, 
b = 2 L*°S 2 (™-i)J which depends only on Player 2’s (!) remaining pool, and is always strictly < \_^n\. This 
has a low probability of success (always strictly< i) but would put Player 1 back in the running if he were 
lucky. 

The concept of risky plays with big payoffs in stochastic games has a rich history. A classic example is the 
single player casino game red-and-black when playing against a subfair casino studied by Dubins and Savage 
|3]. There is also a two player version of red-and-black, where players try to bankrupt each other, which has 
also been studied by Secchi [S], and adapted by Pontiggia [7]. Still more variations are studied more recently 
by Ghen and Hsiau [T]. In all these examples, the authors find when bold plays are optimal. 

Another example, more similar to “Guess Who?”, is the two player dice game “Pig” where players race to 
100 points while managing risk vs. reward on each of their turns. Without the complication of racing against 
the opponent, Haigh and Roters [5] are able to conduct exact analysis. A game-show version where the 
players play simultaneously has also been studied by m- However, the interplay between two racing players 
complicates things. Neller and Presser [5] have numerically calculated the optimal strategy in situations 


2 



































































































where both players have < 100 points using dynamical programming techniques, but there is no known 
simple description of the optimal strategy like we have for “Guess Who?”. The optimal strategy to “Guess 
Who?” in Theorem o is satisfying because we can find a simple exact solution on how and when to play 
boldly. 

1.2 Log Periodicity and “Continuous Guess Who?” 

The landscape of the game exhibits a log periodic behavior. In particular for fixed c > 0 , p*{n, cn) does not 
converge as n —>■ oo. Instead it approaches a function which is periodic in log 2 (n). This kind of behavior is 
not uncommon in this kind of stochastic system, for example see group Russian roulette studied by van de 
Brug et al. m or ties amongst i.i.d. random variables studied by Eisenberg et al. 0. In “Guess Who?” 
there is an asymptotic function x ^ [0,1] which is exactly log 2 -periodic in the sense that 

Ploi‘^x,2y) =p*^{x,y) and for which p*{n,m) =p^(n,m) + O (^). 

This function p^ was first discovered as the probability that Player 1 wins a generalized game called 
“Gontinuous Guess Who?” which is introduced and studied in Section This game is more straightforward to 
analyze than “Guess Who?” because there are no small number effects. Figurej^shows a plot of p*^{2^a, 2^/?) 
when (a, /?) is in the L shaped region (2, oo) x (1, 2] 1J(1, 2] x (1, oo). The landscape of p*^ on all of x 
can be understood by tiling scaled copies of this L shaped region onto the entire quadrant. 



a 


Figure 2: The probability (2^0;, 2^/3) of Player 1 winning “Gontinuous Guess Who?” from the position 
X = 2^a, y = 2^/3 in the region {a, (3) S (2,oo) x (1,2] and (q;,/3) G (1,2] x (l,oo). This closely approximates 
“Guess Who?” when n,m are large: p*{n,m) = p^{n,m) + 0{:^). 

This log-periodicity means that if both players start with the same pool size, no matter how large. Player 
1 will always have a significant first turn advantage. If he plays optimally he will win with a probability 
between | and |, depending on the particular starting size. It would be more fair to start Player 1 with a 
larger pool of candidates to offset his advantage from playing first. The fair way to do this is enlarge his 
starting pool by a factor between | and |. 

1.3 Real World Applications 

Even though “Guess Who?” is an abstract game, the key features of the game are more broadly applicable: 

• Players are racing to reach a specific goal and get nothing for a second place finish. 

• The progress of both players toward the goal is public knowledge. 

• Players have the ability to balance risk vs reward on the speed of progress toward the goal. 
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In terms of these features, “Guess Who?” can be thought of as a toy model of some situations where two firms 
are competing to be the first to bring a product to market and have the ability to manage risk vs reward on 
the time it takes for product development. Our analysis suggests that the leading firm should take no risks 
(i.e. try only to minimize expected development time) and the trailing firm should take the smallest risk 
possible catch up to the leading firm. 

An example from history is the space race between the US and Soviet Union. After Apollo 8 became the 
first manned mission to orbit the Moon in December 1968, the Soviet Union went forward with a launch of 
its N1 rocket to attempt lunar orbit in February 1969 despite warnings from engineers on the low probability 
of mission success. The rocket catastrophically failed, but nevertheless the Soviet Union quickly scrambled 
for another attempt in July 1969, just 13 days before the launch of Apollo 11. The rocket exploded only 23 
seconds after launch and resulted in the largest non-nuclear man-made explosion of all time. Clearly, the 
Soviet Union was trying to increase the speed of development at the price of increased risk. They knew that 
taking risks was the best way to maximize the chance of catching up. 


2 The Model 


Definition 2.1. Mathematician’s “Guess Who” is a game between two players played on the statespace: 


S = {{n,m,Pi) : n S N, m S N, i £ {1, 2}} . 


( 2 . 1 ) 


The first entry indicates the number of people remaining in Player I’s pool of possible candidates for Player 
2’s mystery character. (This is the number of characters who have not yet been removed from consideration 
through previous questions.) The second entry indicates the same thing for Player 2. The last token, either 
Pi or P 2 indicates whose turn it is to play. Players alternate turns. 

If the last token is Pi, it is Player I’s turn. On his turn, from the state (n,m. Pi), Player 1 makes a bid 
61 £ [l,n — 1] nN. This represents a “Yes-No” question that Player 1 may ask Player 2 about his hidden 
character. bi represents the size of the set of “Yes” answers. Once Player 1 has selected his bid, the game 
state evolves stochastically according to the following rules: 


Starting at {n,m,Pi) with Player 1 bidding bi 


{bi,m,P2) 

{n - bi,m,P 2 ) 


with probability ^ 
with probability 


This reflects the fact that Player 2’s mystery character is uniformly distributed in the pool of characters. 
From this state, it is Player 2’s turn. Player 2 makes a bid 62 £ [1, to — 1] O N, and the state space evolves in 
the same way: 


From {n',m,P 2 ) with Player 2 bidding 62 


in', b, Pi) 

(n', TO- 62 , Pi) 


with probability ^ 
with probability 


All random chances in this game are independent of one another. Afterward it is Player I’s turn again and 
play repeats. Players continue in this way until either player reduces his pool of candidates to exactly 1. 
When this happens that player immediately winsj^ That is to say that the following states are terminal 
states for the game: 


Vto > 1, (1,TO, P 2 ) ■£>■ Player 1 immediately wins 
Vn > 1, (n, l,Pi) £->■ Player 2 immediately wins. 

The state (1,1, Pi) is undefined because there is no way to reach this state without first passing through one 
of the previously defined terminal states. 

Remark 2.2. Definition |2.1| puts the game of “Guess Who?” in the framework of a Simple Stochastic Game 
as defined by Gondon [5]. These are also sometimes called 2Uplayer games, where in addition to Player 1 
and Player 2, the randomness in the game is thought of as | of a player. Because of the randomness inherent 
in these games there is no strategy for either player that guarantees victory. (Indeed, in “Guess Who?”, the 
opponent can always make a bid of 1 , and if they are lucky, will immediately win.) 

^In some versions of “Guess Who?”, players are required to make an additional “special” final guess, even if they have already 
reduced the pool to a single individual. We do not consider this rule set here. 
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Definition 2.3. The theory of simple stochastic games going back to Shapley jH] and Condon show the 
existence of an optimal bidding strategy we will denote by &* : N x N —)■ N and an optimal probability function 
we will denote by p* : N x N —>■ [0,1] that is optimal for Player 1 in the sense thaiQ 

• If Player 1 makes a bid of b*(n,m) whenever the state {n,m,Pi) is encountered, then no matter what 
strategy Player 2 chooses, Player 1 wins with probability > p*{n,m). 

• If Player 1 uses any other bidding strategy whatsoever at the state (n,m,Pi), then Player 2 has a 
strategy that ensures that Player 1 wins with probability < p*(n,m). 

When both players play their optimal strategies. Player 1 will win with probability exactly p*(n,m). 
Since “Guess Who?” is symmetric between Player 1 and Player 2 (in the sense that the position {n,m,Pi) 
is functionally identical to the position (to, n, P 2 )), the optimal bidding function for Player 2 from (n, to, P 2 ) 
is b*{m,n) and Player 2’s optimal probability to win from (n, to, P 2 ) is p*{m,n). By the same token. Player 
I’s probability to win from {n,m,P 2 ) is 1 — p*{m,n) since Player 1 wins if and only if Player 2 loses. Both 
players strategies and bids can be encapsulated in a single function. Without loss of generality then, we will 
thus normally take the point of view of Player 1 in our analysis. Because of this symmetry, the optimal 
probability function satisfies a nice recurrence relation: 

Proposition 2.4. p*(n,m) and b*{n,m) satisfy the following recurrence relation: 

p*{n,m) = max i 1 - p*{m,b) - p*(m,n — b'\ 

hG[l,n-l] ( n n J 

I h Ti — h 

b*(n,m) = arg max <1 - p*(m,b) - p*(m,n — b) 

f)e[i.n-i] ( n n 

(For the case the argmax is not unique: any b that maximizes the function will work for Player Vs optimal 
strategy) 

Proof. Let pb(n,m) be the probability that Player 1 wins from the position {n,m,Pi) if he bids b at 
(n, TO, Pi) and both players play optimally thereafter. By the rules of the game Pb{n, to) = ^ (1 — p*{m, n)) + 
(1 — p*{m, n — b)) since with probability ^ we move to the position {b, to, P 2 ) where Player I’s probabil¬ 
ity to win is 1 — p*{m, n) and with probability we move to the position (n — &, to, P 2 ) where Player I’s 
probability to win is 1 — p*(rn, n — b). 

It must be that p*{n,m) = max^gjj^ „_ 2 ] p&(n, to) since Player 1 must bid some value of b at the position 
(n, TO, Pi), and the optimal strategy is no worse than any fixed bid b. Similarly, any b* that has pb* (n, to) = 
p*(n,m) means that bidding b* at the position {n,m,Pi) is an optimal bid. □ 

Remark 2.5. “Guess Who?” has a nice structure because the sum s = n -|- to is a strictly decreasing function 
as the game progresses. Indeed, this sum is guaranteed to decrease by at least 1 on each players turn. This 
observation, along with the initial data Vto > 1, p*{l,m) = 1 and Vn > 1, p*{n, 1) = 0, mean that p*(n,m), 
b*{n,m) can be explicitly computed inductively by the very simple Algorithm given belovj^ At each stage 
s, the algorithm has already computed p*{n, to) for all pairs (n, to) with n -I- to < s — 1. The values p*{m, b) 
and p*{m, n — b) which appear in the maximization have to -I- 6 < s — 1 and m + {n — b) since 1 < & < n — 1 
and TO -|- n = s, and are hence already known. The proof of the main Theorem 1 1.1 1 also uses this observation 
as part of a proof by induction. The output when r = 64 is displayed in Figure for 1 < n,m < 32. 


( 2 . 2 ) 

(2.3) 


Algorithm 1 Numerically computing p*(n, to) and b*{n,m). 

Input: r > 3 Output: The value p*{n, to) and b*(n, to) for all pairs (n, to) with 3 < n -I- to < r 
0. Initialize p*(l, to) = 1 for all to > 1 and p*{n, 1) = 0 for all n > 1 

1. for s = 3 to r: 

2 . for ( n, to) with n -|- TO = s: 

3. p*{n,m)^maxbe[i,n-i] {l - ^P*{m,b) - ^p*{m,n-b)} 

4. b*{n,m) ^ argmax^gj^ {l - ^p*{m,b) - ^p*im,n-b)} 


^This function p* (n, m) is sometimes called the value of the node (n, m, Pi) and the strategy b* is called a positional strategy 
since it depends only on the current position. 

^Note that for some {n,m) the value b*(n,m) is not unique and will depend on how argmax is implemented. 
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2.1 “In the weeds” and “the upper hand” 

We now divide the statespace S into some subsets which turn out to be relevant for discussing the optimal 
strategy in the game. Except for the trivial states where n = 1 or m = 1, these subsets partition the entire 
statespace S in the sense that each element of the statespace is in exactly one of the subsets. This division 
was discovered by careful examination of the output of Algorithm 

Definition 2.6. For fc £ N U {0}, define the set Wk,p^ C S by: 

Wk,P, := {(n,m,Pi} : 2^+^ < n and 2^ < m < 2^+^} . (2.4) 

When (n, m, Pi) £ Wk,Pi, we say that Player 1 is in the weeds at level k, or if {n, m, Pi) Gur=o fPfc,PiWe 
say Player 1 is in the weeds without specifying which level. Similarly, Wk,P 2 '■= 

P 2 ) ■ 2 '^'^^ < m and 2^ < n < and we use the same terminology for Player 2. 

Definition 2.7. For fc £ N U {0}, define the set Uk,Pi C S by: 

Uk,Pi ■= {{n,m,Pi) : 2^ < n < 2^'^^ and 2^ < m} . (2.5) 

When {n,m,Pi) £ Uk,p^, we say that Player 1 has the upper hand at level k, or if {n,m,Pi) £ 
say Player 1 has the upper hand without specifying which level. Similarly, Uk,P 2 ■= 

{{n,m, P 2 ) : 2^ < m < 2^+^ and 2^ < n} and we use the same terminology for Player 2. 

With these two definitions solidified, the description of the optimal strategy for the game is very simple: 
players should bid 2^ when in the weeds at level k and should bid \_^n\ when they have the upper hand. 

3 “Continuous Guess Who?” 

The exact formula in Theorem |1.1| is proven by induction, which does does not shed much light on how the 
formula is discovered or what the individual terms in it represent. To aid understanding, we present the 
original method that led to the formula p*{n,m). From numerically computing p*(n,m) and b*{n,m) for 
some small values of n,m using Algorithm we get an ansatz on the optimal bidding strategy b*{n,m). 
We then exactly calculate the probability for either player to win when they follow this bidding ansatz in 
a modified version of “Guess Who?” which is defined in such a way to remove small number effects. The 

probability of winning in this modified game turns out to be a good approximation to the probability of 

winning in “Guess Who?” in the asymptotic regime when the pool sizes n, m are very large, and this is what 
first led to the formula p*{n, m). 

Definition 3.1. “Gontinuous Guess Who?” is a two player game on the statespace: 

S^ = {{x,y,P,) : a:£K>°, 2 /£K>°,z£ {1,2}}. (3.1) 

The partition of the statespace into “in the weeds” and “the upper hand” are generalized to this continuous 
setting by allowing any exponent fc £ Z: 

:= {(2'=a,2'=/3,Pi) : a £ ( 2 , 00 ) and ^ £ (1,2]} C Soo,fc £ Z, (3.2) 

U^P^ := {(2'=a,2'=/3,Pi) : a £ (1,2] and/3 £ ( 1 , 00 )} C Soo,fc £ Z. (3.3) 

W^p^, U^p^, fc £ Z are defined analogously. Play proceeds in a similar way to “Guess Who?”: on Player I’s 
turn, from the state {x, y, Pi), Player 1 is allowed to make any (not necessarily an integer) size bid bi £ (0, x) 
and the game state evolves stochastically according to: 


Starting at {x,y,Pi) with Player 1 bidding bi —)■ 


{bi,y,P2) 

{x-bi,y,P2) 


with probability ^ 
with probability 


On Player 2’s turn he makes a bid 62 S (0, y) and the game state evolves analogously. All of these random 
outcomes are independent. 

The game has no ending point: players continue playing indefinitely. Instead, the events (Player 1 loses}, 
(Player 2 loses}, (The game is a draw} are defined abstractly as events on the underlying probability space. 
The event that Player 1 loses is defined to be the event that Player 1 is in the weeds “almost always”; that is 
there exists some time beyond which Player 1 is trapped in the weeds forevermore. Player 2 losing is defined 
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analogously. The event that the game is a draw is defined to be the event that both players each alternate 
being in the weeds infinitely often. Denoting the state of the game after t steps as St G Soo, this is written: 


{Player 1 loses} 

{Player 2 loses} 

{The game is a draw} 
We define also {Player 2 wins} 


:= < BT such that > T, St G [J on Player I’s turns 

I, k——oo > 

{ CXD '' 

3T such that Vt > T, S'* G Player 2’s turns 

k——oo > 

:= \StG U i.o. and St G \J i.o. . 


k— — c 


k— — c 


:= {Player 1 loses} and {Player 1 wins} := {Player 2 loses}. 


Remark 3.2. “Continuous Guess Who?” is a very peculiar game because of the strange way that winning is 
defined. Indeed, the players cannot know if they have won or lost the game after finitely many rounds of the 
game. (There is always a positive probability the trailing player will recover and win, no matter how dire 
their situation may become.) Practically speaking, one should think of “Continuous Guess Who?” only as a 
tool that closely models ordinary “Guess Who?” when the length of the game is very long. In the remaining 
analysis in this section, we will show that when both players play according to an optimal bidding ansatz, 
the probabilities of winning from any state can be calculated exactly and the probability of a draw is 0. 

Definition 3.3 (Optimal Bidding Ansatz). We assume both players play according to the following strategy: 

• When Player 1 is in the weeds at level k, (2^0, 2^/3, Pi) G W^p^, Player 1 makes a bid of 2^. 

• When Player 1 has the upper hand at level k, {x,y,Pi) G U^p^, Player 1 makes a bid of ^x. 

Player 2 plays analogously. This ansatz was hypothesized by examining output from Algorithm [l] 

Definition 3.4. Define x —)■ (0,1) to be the probability that Player 1 wins starting from the 

position {x,y,Pi) on his turn when both players play according to the optimal bidding ansatz: 


p^{x, y) := P (Player 1 wins from (x, y. Pi)). 


(3.4) 


Lemma 3.5. For any j G Z, p^(x,y) satisfies: 

pIo{x, y) = pIo {2^x, 2^y) . (3.5) 

Proof. Define the map (f> : {x,y,Pi) —> (2?a;, 2??/, Pi) and we will show that the winning probability is 
invariant under the map (j). Notice firstly that (j) sends the sets W^p, -G W^j p. and P“p. —t p,. Thus 
this map has no effect on whether Player 1 is in the weeds or has the upper hand. We next observe that 
under the optimal bidding ansatz, the probability of a bid being a success is also invariant under the map f. 
Indeed, when Player 1 has the upper hand, he makes a no-risk bid and his pool size is plainly halved; when 
Player 1 is in the weeds at level k, (2^0, 2^/3, Pi) G kP^pp a > 2, j3 G (1,2], he makes a bid of size 2^ and 
his pool size after the bid is: 

J 2*^ with probability 

\2'^a - 2^ with probability ' 

The factor 2^ cancels, and this probability depends only on a, which is invariant under f). From the above 
considerations we see that if we let St be the state after t rounds started from (a;, y, Pi) and we let Rt to be 
the state after t rounds started from (2?a;, 2?p, Pi), then we have equality in distribution: 


Rt = <t>{St). 


From this invariance, along with the fact that 4> does not change whether or not Player 1 is in the weeds, 
and the very definition of Player 1 winning the game, we conclude that p^ is invariant under the map (j) as 
desired. □ 


Lemma 3.6. Assume both players play “Continuous Guess Who?” according to the optimal bidding ansatz. 
Suppose we start from the position (2^a, 2^/3, Pi) with a>2 and f3 G (1,2], so that Player 1 is in the weeds 
at level k G Z. Then, the probability that Player 1 ever exits the weeds at any point throughout the game is 
2 

a 
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Proof. Since Player 1 is in the weeds at level k, his first bid is 2*. This means that his pool size becomes: 


I 2'^a - 2^ 


with probability 


with probability 


2'=a-2'‘ 

2'=q 


If his pool size becomes 2^, the pool sizes are now x = 2^,y > 2^+^ meaning that Player 1 now has escaped 
the weeds and has the upper hand. Thus Player 1 escapes from the weeds on the first guess with probability 

2*= _ J_ 

2^a a ' 

If Player 1 fails to escape the weeds on this turn, his new pool size is 2^0 — 2^ = 2 ^~^ (2(q! — 1)). At 
this stage Player 2 has the upper hand, and will make a bid of exactly |2^/3 and so Player 2’s new pool 
size is halved and will be exactly 2^“^/3. Thus, if Player 1 fails to es cape the weeds at this level, the game 
state moves to cc = 2^~^ (2(a — 1)) and y = 2^“^/3. By Lemma 3.5 the probability of winning is invariant 


under changing k, and so this position is handled by the same analysis as above with a new a-value which 
is a' = 2(q; — 1). Player I’s probability to escape the weeds at this stage is ^ = 2 {a-i) ■ This analysis can 
be repeated over and over again. Finally, to find the probability Player 1 ever escapes the weeds, we sum up 
his probability to escape at each stage: 

P (Player 1 ever escapes the weeds) 

1 1 


= -+(l-- 
a V a 


+ 1 - 


,2 (a-l) V 2(a-l) 
Fortunately this expansion telescopes and we remain with: 


2 (2 (a-l)-l) 


P (Player 1 ever escapes the weeds) 
1 /a-l\ 1 fa-1 


a J 2{a — 1) 


a 

1 1 1 

-1“ X-1“ -h . . . . 

a 2a 4a 


2 (a- 1) - 1 

2 (a-l) 


1 


2(2(a-l)-l) 


+ . . . 


Evaluating this infinite sum gives the desired result of —. 


□ 


Next we prove a lemma about the position (2^ • 4,2^ • 2, Pi). This position is important because it is 
naturally encountered when both players play according to the optimal bidding ansatz. 

Lemma 3.7. Assume both players play “Continuous Guess Who?” according to the optimal bidding ansatz. 
Starting from the position (2^ • 4, 2^ • 2, Pi) (i.e. a = 4, /3 = 2), the probability Player 1 loses the game is | 
and the probability of a draw is 0. 

Proof. Player 1 can lose the game in one of two disjoint ways: 

a) Player 1 never gets out of the weeds. The probability that Player 1 never gets out of the weeds is 1 — | 
by Lemma |3.6[ 

b) Player 1 successfully gets out of the weeds at some point and puts Player 2 in the weeds; then Player 
2 himself gets out of the weeds; and finally Player 1 subsequently lo ses t he game from that position. The 
probability of this chain of events is calculated as follows: by Lemma 


3.6 


with probability |, Player 1 does 

get out of the weeds at some point, and the position will then be x = 2* • 2 and j/ = 2^ • 4 for some I, (i.e. 
the game evolves to a' = 2,/3' = 4). The position must be of this form because Player 2’s total is always a 
power of 2 up to this point (he started at a power of 2 and is always halving his pool up to this point) and 
Player I’s bid is the largest power of 2 which is strictly less than Player 2’s pool size. Fr om t his new position. 

If Player 2 does 


3.6 


the probability that Player 2 ever gets out of the weeds from this point is | by Lemma 
successfully get out of the weeds, the position will be a; = 2-1 • 4 and y = 2? -2 for some j by the same reasoning 
as above, (i.e. we the state has evolved back to the original position a" = 4, /3" = 2). Thus we have: 

P (PI loses from (2'= • 4,2^= • 2, Pi)) 

= P(P1 never escapes) + P (PI escapes) P (P2 escapes) P (Pl loses from ( 2 ^-4, 2 ^- 2 ,Pi)) 

= ~ 4^ T ^ ^ ■ P (P1 loses from (2-^ • 4, 2^ • 2, Pi)) 

= ^ (PI loses from {2^ ■ 4, V ■ 2, Pi)) . 















By Lemma IS.Sl the positions ^2^ • 4, 2^ • 2, Pi) and (^2^ ■ 4,2-^ • 2, Pi) are identical with regard to losing 
probability. Solving the linear equation p = ^ + gives p = as desired. The probability of a draw is 0 
because the probability of switching from one player being in the weeds to the other being in the weeds from 
this position has probability | each time, and so having infinity many switches is a probability 0 event by 
the Borel-Cantelli lemma. □ 


Proposition 3.8. Assume both players play “Continuous Guess Who?” according to the optimal bidding 
ansatz. Suppose we are at the position {x,y,Pi) = (2^0, 2^/3, Pi) with a> 2 and /3 G (1,2], so that Player 1 
is in the weeds at level fc € Z. Then the probability of a draw is 0 and the probability that Player 1 wins the 
game is: 

ofc+1 o o2fe+l 

pU^,y) = --■ ( 3 . 6 ) 

X o xy 

Proof. We first calculate the probability that Player 1 loses. As in Lemma [3.7| Player 1 can lose the game 
in one of two disjoint ways: 

a) Player 1 never gets out of the weeds. The probability that Player 1 never gets out of the weeds is 1 — - 
by Lemma |3.6[ 

b) Player 1 successfully gets out of the weeds at some point and puts Player 2 in the weeds; then Player 
2 himself gets out of the weeds; and finally player Player 1 subsequently loses the game from that position. 
If Player 1 does get out of the weeds, the position will be cc = 2^ • 2 and y = 2^ ■ (2/3) for some £, (i.e. the 
position evolves to a' = 2, /3' = 2/3). Thus the probability that Player 2 gets out of the weeds from this point 
is ^ by Lemma 3.6 If Player 2 does successfully get out of the weeds, the position will be x = 2^ • 4 and 


y = 2^ ■ 2 for some j, (i.e. we will be at a" = 4,/3" = 2). We have already analyzed this position in Lemma 
ITfl Thus we have: 

P (PI loses from (2'=a, 2'=/3, Pi)) 

= P (PI never escapes) + P (PI escapes) P (P2 escapes) P (Pl loses from ( 2 ^-4, 2 ^- 2 ,Pi)) 


= 1 - 


a 

2 

1 - 

a 


+ 


a 


2 

2P 


^ P (Pl loses from (2^ • 4, 2-' • 2, Pi)) 


a U/3y 3' 


The probability of a draw is 0 since if more than two changes of the upper hand occur, the game passes 
through the position (2^ ■ 4, 2? ■ 2, Pi), from which the probability of a draw is 0 by Lemma 
taking the complement and plugging in x = 2^a,y = 2^13 gives the result. 


3.7 


Finally then, 
□ 


Proposition 3.9. Assume both players play “Continuous Guess Who?” according to the optimal bidding 
ansatz. Suppose we are at the position {x,y,Pi) = (2^a, 2^/3, Pi) with a G (1,2] and j5 > 2, so that Player 1 
has the upper hand at level k G Z. Then, the probability of a draw is 0 and the probability that Player 1 wins 
the game is: 

Qk 2 2 ^^ 

plo{x,y) = ^ - h-—. (3.7) 

y 3 xy 

Proof. When Player 1 has the upper hand, he bids exactly ^2^0 and puts Player 2 in the weeds. By the 
symmetry between Player 1 and Player 2, we use Proposition |3 .8 1 to see the probability of a draw is 0 and to 
compute: 

P (Player 1 wins from state (2^a, 2^/3, Pi)) 

= P (Player 1 wins from state 2^“^ (2/3) ,P2)) 

= 1 — P (Player 2 wins from state (2*“^Q!, 2*^“^ (2/3), P2)) 

= 1 — P (Player 1 wins from state (2*“^ (2/3), 2 ^“^q;, Pi)) 


= 1 - ; 


2/3 2/3 \2aJ 3 


Plugging in x = 2^a,y = 2^/3 gives the result. 


□ 


Remark 3.10. Despite the difference between “Continuous Guess Who?” and ordinary “Guess Who?”, this 
analysis comes remarkably close to the true value p*{n, m) when evaluated at integer values. Indeed we have: 


p*{n,m) - pl^{n,m) = 


-|D_ if 
6 nm 


4 1 
3 nm 


n,m,Pi) G Wk,Pi 
if (n,TO, Pl) G Uk,p^ 


(3.8) 
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The corrections — | ^ and | ^ were found serendipitously by comparing numerical solutions of p* to the 
exact formula p^. The exact formula for p* was first hypothesized by adding these simple errors onto p^. 
A plot of p^ is provided in Figure]^ 


4 Proof of Theorem 11.11 

Having guessed at the formula using the above analysis, we now prove Theorem o rigorously. Define 
functions: 


qWk{n,m) := 


2/c+i 2 22'=+! + 1 

n 3 mn 

■)fc o o2fc 


2 '“ 2 2 ^'' + 2 

qu^{n,m) := 1-+ --. 

m 6 mn 

By stitching these functions together, these define the total function g(n,m): 


(4.1) 

(4.2) 


q{n, m) = 


'qWki'n^rn) \i {n,m,Pi) &Wk,p^ 
qUk {n,m) if {n,m,Pi) S Uk,Pi 
1 if n = 1 

0 if TO = 1 


(4.3) 


Lemma 4.1. Suppose that (n,m,Pi} G bFfe,Pi for some k. Then: 

QWtin-Trn) = max i 1 - q{m,b) - q(m,n — b) 

bG[l,n-l] ( n n 

and the maximum is achieved atb = 2^. 


(4.4) 


Proof. By symmetry between b and n — b, we assume without loss that 1 < 5 < [|nj. We will show that 
1 — ^q{m, b) — n — b)< qw^ {n, to) for all possible bids b with equality achieved at b = 2^. Consider 

the following cases: 

Case I: {b,m,P 2 ) G with 2^ < b < 2^+^ and (n — &, to,P 2 ) € C^fc.Pa- 

We begin by simplifying: 


1 - -qwe {m, b) - - - qu^ (to, n-b) 

n n 

^ 6 / 2^+1 2 22^+1 + 1 ^^ n-b 

n \ m 3 mb 


(4.5) 


n 


1 - 


2 2 ^'= 
+ X- 


2 fc 


n — b 3 m{n — b) 


1 - 


2 ^+] 


TO 


2k 2 2 ^^+! - 22 ^= - 1 

- 1 " Q 

n 3 mn 


Since {b,m,P 2 ) G we have 2^+^ < to here. It follows that for a fixed £, the above is a monotone 

increasing function of b. We thus obtain an inequality by replacing b by its maximum value b = with 

equality if and only if 6 = 2 ^+^: 


< 


1 - -qwiipi^b) - -— ^quk{m,n- b) 
n n 

2^+1 / 2 ^"^^ \ 2 ^ 2 _ 2 ^^_ 1 

—Q- 

n 3 mn 
2k 2 2 ^'= + 1 

n 3 mn 


n 

2^+1 


n 


1 - 

2 2^+1 
3 TO 


This is monotone increasing as a function of 2^+^ since 2^+^ < to. To see this, notice that if we increment 
£ by one, the difference is: 


P+i 


1 - — 2^+1 
3to 


- 2M 1- 2 ^ ) = 2M 1 - 

3to 


2i+i 


> 0 . 


Thus this function is maximized when £ takes its maximum value. We notice that we must have £ < k — 1, 
since {b,m,P 2 ) G Wg^p^ gives 2^+^ < to and (n, to, Pi) G Wk^p^ gives to < Thus this function is 
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maximized when £ = fc — 1, with equality if and only if £ = A: — 1. By inspection, plugging in £ = fc—1,6=2^ 
gives the maximum value is exactly (n, m) as desired. 

Case II: ( 6 , m, P 2 ) S Uk,P 2 and (n — b, m, P 2 ) S Uk^p^ ■ 

Consider here: 


< 


-qUk (w, b) - - - ^-qu^ (to, n-b) 

n n 

n — b 
n 

2 2^^ -I- 2 2^ 2 2^^ -|- 2 

0-1 --- 1 --- 

n o mn n d mn 
2k+i 2 22'=+i+4 

n 3 mn 


n \ 6 3 mb ) 


2k+i 2 22'=+! -t 1 

n 3 mn 


qWk{n,m). 


2k 2 2 ^^ + 2 
n — b 3 m{n — b) 


Interestingly, this does not depend on b at all, and is always worse than q^y^ (n, to) by ^. 

Case III: ( 6 ,to,P 2 ) G W^ 4 ,P 2 and (n - 6 ,to,P 2 ) G W^£„_i,.P 2 > where £h = [log 2 (&- 1)J in-b = [^og 2 {n -b-l)\. 
Assume without loss that £„_& > £b- As in Case I, because (n — 6 , to, P 2 ) G We^_^,P 2 and {n,m,Pi) G 
we must have £& < £„_;, < fc — 1. But then we see that there are no values of b that fall into this 
case at all, since we would have b < 2 ^*’+^ < 2 ^ and n — b < 2 ^"-'’ < 2 ^ which, by summing, is seen to be in 
contradiction to the fact that n > 2^^^ + 1 from (n, to, Pi) G Wk,p^- □ 


Lemma 4.2. Suppose that {n,m,Pi) G Uk^p^ for some k. Then: 


QUk (n, m) 


max <1 - q{m, b) 

be[i,n-i] ( n 


n — b 

- q{m, n 

n 



(4.6) 


and the maximum is achieved at b = . 

Proof. By symmetry between b and n — b, we assume without loss that 1 < & < • We will show that 

1 — ^q{m, b) — '^^^q{m, n — b)< qu^ (n, to) for all possible bids b with equality achieved at 6 = . Consider 

the following cases: 

Case I : ( 6 ,to,P 2 ) G Wp po with 2^ < b < 2^+^ and {n — b,m,P 2 ) G Uk,P 2 - 

for 1 — ^qwe (pi; b) — ^^^^qUk (pIj n — b) and show directly that the difference from 


We start with Eq. 


4.5 


qjj^ (n, to) is always positive. It is convenient to begin by multiplying out the common denominator: 


nm qu^ (n,m) - 1 - qwe (m, b) - 


n — b 


n 


= nm 


TO 3 mn 


- I - I 1- 

n 


n 
2 I +1 


QUk i'm, n-b) 


TO 


2k 2 2^^+! - 2^'= - 1 

H-h -- 

n 3 mn 


= nm-n2'^ + l (2^'= + 2) - 6 (to - 2^+^) - to2'= - ^ {2^^+^ - 2^'^ - l) 

O O 

= n (to - 2'=) - 6 (to - 2 ^+^) - to 2 '= - ^ 2 ^^+^ + ^ 2 ^'= + 2 . 

O O 


Now notice that since {n — b, to, P 2 ) G Uk,P 2 , it must be that n — b > 2^ and to > 2^. Thus the coefficient of 
n is positive, so using n > 2 *^ + 5 gives: 

> {2^ + b){m- 2 '=) -b{m- 2^+^) - to2 '= - + ^2'^'^ + 2 

o o 

= to2 '= + bm- 2^^ - 62'= - 6 to + 62^^+^ - to2 '= - ‘^2^^+^ + ^2^'= + 2 

o o 

= (2^= - 2^+1) --22^+1+ -2^'= +2. 

3 3 


We notice now that it must be the case that £ + 1 < fc, because {n — b, to, P 2 ) G Uk.P 2 gives 2'=+^ > to and 
( 6 , TO, P 2 ) G We^P 2 gives m > 2^+^. Hence the coefficient of b is non-positive, and we can use b < 2^+^ to get: 


> _2^+i (2^= - 2^+1) - -22^+1 + -2^'= + 2 

3 3 

= 222(,+i) _ ^e+i2k ^ l22fc ^ 2 . 

3 3 


11 






















Finally, since \ <k, define £ + 1 = fc — c, where c S N U {0} and then write the above expression as: 


= 2 ^^ I - 2~^ 


2 . 


By inspection, §2“^'^ — 2“'^ + | = 0 for c = 0 and c = 1 and is strictly positive for c > 2. Thus this expression 
is always > 0 and we have the desired inequality. 

Case II: (b, m, P 2 ) € and (n — 6 , 771 ,^ 2 ) € with 4 = [log 2 ( 6 — 1)J and£„_b = [log 2 (u — h— 1)J. 

We have in this case: 


h n — b 

1- qwu (m, b) - qw^ (m, n-b) 

n n 


= 1 - 

= 1 + 


b 2 2^‘’+i - 1 


_^/2C-b+i 2 


n \ m 3 mb J n \ m 3 m{n — b) 
4 2 

3nTO nm 


^ - ^2^'’^ 2^'’ + ^(u -b) - 2^ 


So it satisfies us to minimize the function that appears: 


2 ^ 4 ^ 2^0 + (^{n-b)- 


2*^ri-b I 


Since £& = [log 2(6 — 1)J , in-b = UoS 2 (^ ~ ^ ~ 1)J recognize this as f{b) + f{n—b) where / : N —>■ M is the 
function /(a) = (a — ) 2 L^°S 2 (“-i)J, But now we observe that this function is convex. Moreover 

this is a piecewise linear function and can be written as a maximum over a number of linear functions 
/(a) = supjgf^ (a — |2*) 2b Observe then that /( 6 ) + /(n — b)> )■ other words the bid 

b = \_^n\ cannot be beaten. When we plug in 6 = we get exactly the desired bound of qu^{n,m). 

Case III: (6, m,P 2 ) G {n — 6 , 771 ,^ 2 ) G Uk,P 2 

There are no such values of b since we would have 6 > 2^ + 1 and n — b > 2^' + 1 from the definition of 
Uk,P 2 , which by summing is seen to contradict n < 2^+^ from the fact that {n,m,Pi) G Uk,Pi- □ 


Remark 4.3. A careful reading of Case II of Lemma [4.2| reveals that for some values of n, there are other 
bids b that match the performance of b = \_^n\. Indeed, the function / is piecewise linear so when \_\n\ is 
on the interior of a linear segment, other bids that have both b and n — b on the same segment will do just 
as well. 

Theorem 4.4. (Restatement of Theorem dip ITe have that p*{n,m) = q{n,m) for all n,m and an optimal 
bidding is b*{n,m) = 2^ if {n,m, Pi) G Wk^p^ and b*{n,m) = [|nj when {n,m,Pi) G Uk,Pi- 


Proof. From Lemma |4.1| and Lemma 4.2 we know that q and p* both satisfy the same recursion relation 
from Proposition 2.4 We now prove that p*{n,m) = q{n,m) for all (n,m) by induction on the sum n + m. 
Let Ag = {(n, m) : n + m = s} and consider the statement that p*{n, m) = q{n, m) for every for all (n, m) G 
U 3 <s<rAs. The base case r = 3 is clear since p*(2,1) = q{2, 1) = 0 and p*(l, 2) = g(l, 2) = 1. Now suppose 
p*(n,m) = q{n,m) for every for all {n,m) G V3 < s < r. We have then for any (n,m) S As that: 


p*{n,m) = 


max 

feG[l,n—1] 


max 

6G[l,n— 1 ] 


b s., n — b., 

1- p (to, b) - p (to, n — b) 

n n 


1 — —q{m, b) — - — ^qim, n — b) 
n n 


(recurrence for p*) 

(by induction hypothesis) 


= q{n,m) (recurrence relation for q). 


The induction hypothesis can be applied since every pair (to, b), (to, n — b) has m + b < s — 1 and m + n — b < 
s — 1 since 1 < & < n — 1. By the principle of induction, this shows that p*(n,m) = q{n,m) for all 
(n,m) G U 3 <s<r.As and for all r > 3. But UrysAr covers the entire quadrant. Thus p*{n,m) = q{n,m) 


for all n, to as desired. In light of p* = q, Lemma 


4.1 


and Lemma 


4.2 


also show that b*(n,m) = 2 *^ for 
(77, TO, Pi) G bFfe,Pi and b*{n,m) = \_^n\ when {n,m,P 2 ) G Uk.p^ is an optimal bidding strategy. □ 
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